Towards Expressive Speech Synthesis in English on a Robotic Platform
نویسندگان
چکیده
Affect influences speech, not only in the words we choose, but in the way we say them. This paper reviews the research on vocal correlates in the expression of affect and examines the ability of currently available major text-to-speech (TTS) systems to synthesize expressive speech for an emotional robot guide. Speech features discussed include pitch, duration, loudness, spectral structure, and voice quality. TTS systems are examined as to their ability to control the features needed for synthesizing expressive speech: pitch, duration, loudness, and voice quality. The OpenMARY system is recommended since it provides the highest amount of control over speech production as well as the ability to work with a sophisticated intonation model. OpenMARY is being actively developed, is supported on our current Linux platform, and provides timing information for talking heads such as our current robot face.
منابع مشابه
Towards a flexible platform for voice accent and expression selection on a Healthcare Robot
In the application of robots in healthcare, where there is a requirement to communicate vocally with non-expert users, a capacity to generate intelligible and expressive speech is needed. The Festival Speech Synthesis System is used as a framework for speech generation on our healthcare robot. Expression is added to speech by modifying mean pitch and pitch range parameters of a statistical mode...
متن کاملDiscourse Structures of Condolence Speech Act
Condolence is part of Austin’s expressive speech act and is related to Searle’s behabitives illocutionary act. Although a theoretically sound issue in pragmatics, condolence speech act has not been investigated as much as other speech acts in discourse-related studies. This paper aims at investigating interjections and intensifiers while performing condolence speech act among Persian and Englis...
متن کاملCategorizing expressive speech acts in the pragmatically annotated SPICE Ireland corpus
Expressive speech acts are one of the five basic categories of speech acts identified by Searle (1976). Expressives remain underresearched, though select categories of expressive speech acts, especially offering thanks and compliments, have received more extensive attention. An overall classification of expressive speech acts on the basis of corpus data has not yet been carried out. The current...
متن کاملWinkTalk: a demonstration of a multimodal speech synthesis platform linking facial expressions to expressive synthetic voices
This paper describes a demonstration of the WinkTalk system, which is a speech synthesis platform using expressive synthetic voices. With the help of a webcamera and facial expression analysis, the system allows the user to control the expressive features of the synthetic speech for a particular utterance with their facial expressions. Based on a personalised mapping between three expressive sy...
متن کاملExpressive Speech Recognition and Synthesis as Enabling Technologies for Affective Robot-Child Communication
This paper presents our recent and current work on expressive speech synthesis and recognition as enabling technologies for affective robot-child interaction. We show that current expression recognition systems could be used to discriminate between several archetypical emotions, but also that the old adage ”there’s no data like more data” is more than ever valid in this field. A new speech synt...
متن کامل